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Abstract 

We show that almost all known lower bound methods for communication complexity are also lower 
bounds for the information complexity. In particular, we define a relaxed version of the partition bound 
of Jain and Klauck [JKIO] and prove that it lower bounds the information complexity of any function. 
Our relaxed partition bound subsumes all norm based methods (e.g. the 72 method) and rectangle-based 
methods (e.g. the rectangle/corruption bound, the smooth rectangle bound, and the discrepancy bound), 
except the partition bound. 

Our result uses a new connection between rectangles and zero-communication protocols where the 
players can either output a value or abort. We prove the following compression lemma: given a protocol 
for a function / with information complexity /, one can construct a zero-communication protocol that 
has non-abort probability at least 2^*^^^^ and that computes / correctly with high probability conditioned 
on not aborting. Then, we show how such a zero-communication protocol relates to the relaxed partition 
bound. 

We use our main theorem to resolve three of the open questions raised by Braverman [Bral2]. First, 
we show that the information complexity of the Vector in Subspace Problem [KRll] is il(n^/^), which, 
in turn, implies that there exists an exponential separation between quantum communication complexity 
and classical information complexity. Moreover, we provide an lower bound on the information 
complexity of the Gap Hamming Distance Problem. 
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1 Introduction 



Information complexity is a way of measuring the amount of information Alice and Bob must reveal to 
each other in order to solve a distributed problem. The importance of this notion has been made apparent in 
recent years through a flurry of results that relate the information complexity of a function and its communi- 
cation complexity. One of the main applications of information complexity is to prove direct sum theorems in 
communication complexity, namely to show that computing k copies of a function costs k times the communi- 
cation of computing a single copy. Chakrabarti, Shi, Wirth and Yao [CSWYOl] used information complexity 
to prove a direct sum theorem for simultaneous messages protocols (their notion is now usually called the 
external information complexity, whereas in this paper we work exclusively with what is often called the 
internal information complexity). Bar-Yossef et al. [BYJKS04], used the infomiation cost in order to prove 
a linear lower bound on the two-way randomized communication complexity of Disjointness. More recently, 
information-theoretic techniques enabled the proof of the first non-trivial direct sum result for general two- 
way randomized communication complexity: the randomized communication complexity of k copies of a 
function / is at least Vk times the randomized communication complexity of / [BBCRIO]. Then, Braverman 
and Rao [BRll], showed a tight relation between the amortized distributional communication complexity of 
a function and its internal information cost. Braverman [Bra 12], defined interactive infomiation complexity, a 
notion which is independent of the prior distribution of the inputs and proved that it is equal to the amortized 
communication complexity of the function. Braverman and Weinstein [BW12] showed that the information 
complexity is lower bounded by discrepancy. 

The main question pertaining to infomiation complexity is its relation to communication complexity. 
On the one hand, the information complexity provides a lower bound on the communication complexity 
of the function, since there cannot be more information leaked than the length of the messages exchanged. 
However, it is still open, whether the information complexity of a function can be much smaller than its 
communication complexity or whether the two notions are basically equivalent. In order to make progress 
towards this question, it is imperative to provide strong lower bounds for information complexity, and more 
specifically to see whether the lower bound methods for communication complexity can be compared to the 
model of information complexity. 

Lower bound methods in communication complexity can be seen to fall into three main categories: the 
norm based methods, such as the 72 method of Linial and Shiaibman [LS09b] (see Lee and Shi^aibman's 
survey for an overview [LS09a]); the rectangle based methods, such as discrepancy and the rectangle bound; 
and, of course, the information theoretic methods, among which, information complexity. Recently, Jain and 
Klauck [JKIO] introduced the smooth rectangle bound, as well as the stronger partition bound, and showed 
that they subsume both 72 and the rectangle bound [JKIO]. 

The first lower bound on information complexity was proved by Braverman [Bral2], who showed that 
it is lower bounded by the logarithm of the communication complexity. Recently, Braverman and Weinstein 
showed that the discrepancy method lower bounds the information complexity [BW12]. Their result follows 
from a compression lemma for protocols: a protocol for a function / that leaks / bits of information implies 
the existence of a protocol with communication complexity 0{I) and advantage on computing / (over a 
random guess) of 2~'-'^^\ 

1.1 Our results 

In this paper, we show that all known lower bound methods for communication complexity, with the 
notable exception of the partition bound, generalize to information complexity. More precisely, we introduce 
the relaxed partition bound (in Definition 3.2) denoted by prt^(/), which depends on the function to be 
computed /, the input distribution /i, and the error parameter e, and such that the distributional communication 
complexity De{f) > Iog(prt^(/)) for any /. We prove that the information complexity of a function / is 
bounded below by the relaxed partition bound: 
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Theorem 1.1. There is a positive constant C such that for all functions f : I ^ Z, all e,6 (0, g], ^^nd all 
distributions fx, we have IC^(/, e) > ^ • (log prt^^3^(/) — log \Z\^ — 6. 

Since we show in Lemma 3.3 that the relaxed partition bound subsumes the norm based methods (e.g. 
the 72 method) and the rectangle-based methods (e.g. the rectangle/corruption bound, the smooth rectangle 
bound, and the discrepancy bound), all of these bounds are also lower bounds on the infomiation complexity. 
Moreover, together with the direct sum theorem for information complexity, our main result implies a direct 
sum theorem on communication complexity for many notable functions (see Corollary 1.4). 

Technique. The key idea of our result is a new connection between communication rectangles and zero- 
communication protocols, where the players can either output a value or abort but without communicating. A 
priori, it is surprising that protocols with no communication can actually provide some insight on the commu- 
nication or information complexity of a function. However, this model, which has been extensively used in 
quantum information for the study of non-local games and Bell inequalities, turns out to be a very powerful 
tool for the study of classical communication and information complexity. The communication complexity 
of simulating distributions is known to be related to the probability of not aborting in zero-communication 
protocols that can abort [GG99, Mas02, BHMR03, BHMR06]. More recently connections have been shown 
for specific lower bound methods. It has been shown that zero-communication protocols with error give 
rise to the factorization norm method [DKLRll], and the connection between the partition bound and zero- 
communication protocols with abort was studied in [LLR12]. 

In a deterministic zero-communication protocol with abort, each of the two players looks at their input and 
decides either to abort the protocol or to output some value z. The output of the protocol is z if both players 
agree on z, or it aborts otherwise. It is easy to see that for any deterministic zero-communication protocol 
with abort, the set of inputs where both players choose to output z forms a rectangle, and so the protocol is 
characterized by a set of rectangles each labeled by an output. In a randomized protocol, we have instead a 
distribution over labeled rectangles. 

This connection between rectangles and zero-communication protocols with abort allows us to obtain 
our lower bound for information complexity from a new compression lemma for protocols (Lemma 3.4): 
a protocol for a function / that leaks / bits of information implies the existence of a zero-communication 
protocol that has non-abort probability at least 2"'^^^^ and that computes / correctly with high probability 
when not aborting. Our main theorem follows from this new compression. 

The technical tools we use are drawn from Bravemian [Bral2] and in particular Bravemian and Weinstein 
[BW12]. We describe the difference between our compression and that of [BW12]. There, they take a protocol 
for computing a function / that has infomiation cost / and compress it to a protocol with communication 
0{I) and advantage of computing / of 2"'^^^^ {i.e. the error increases considerably). Then, they apply the 
discrepancy method, which can handle such small advantage. 

In our compression, we suppress the communication entirely, and, moreover, we only introduce an ai^bi- 
trarily small error since the compressed protocol aborts when it does not beUeve it can correctly compute the 
output. This compression enables us to provide much sharper lower bounds on the information complexity 
and in particular, the lower bound in terms of the relaxed partition bound. 

AppUcations. Our lower bound implies that for most functions for which there exists a lower bound on 
their communication complexity, the same bound extends to their information complexity. Specifically, we 
can apply our lower bound in order to resolve three of the open questions in [Bral2]. 

First, we show that there exists a function /, such that the quantum communication complexity of / is 
exponentially smaller than the infomiation complexity of / (Open Problem 3 in [Bral2]). 

Theorem 1.2. There exists a function f, s.t. for all e G (0, Q{f, e) = 0(log(IC(/, e))). 

In order to prove the above separation, we show that the proof of the lower bound on the randomized 
communication complexity of the Vector in Subspace Problem (VSP) [KRll] provides, in fact, a lower 
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bound on the relaxed partition bound. By our lower bound, this implies that IC(VSP0^n, 1/3) = n^/^ (Open 
Problem 7 in [Bral2]). Since the quantum communication complexity of VSPe,^ is O(logn), we have the 
above theorem. Moreover, this implies an exponential separation between classical and quantum infomiation 
complexity. We refrain from defining quantum information cost in this paper (see [JNIO] for a definition), 
but since the quantum information cost is always smaller than the quantum communication complexity, the 
separation follows trivially from the above theorem. 

In addition, we resolve the question of the information complexity of the Gap Hamming Distance Problem 
(GHD) (Open Problem 6 in [Bral2]), since the lower bounds on the randomized communication complexity 
of this problem go through the rectangle/corruption bound [She 12] or smooth rectangle bound [CRl 1, Vidl2]. 

Theorem 1.3. IC(GHD„, 1/3) = n{n). 

Regarding direct sum theorems, it was shown [Bral2] that the information complexity satisfies a direct 
sum theorem, namely IC^fc(/^',e) > k ■ IC^(/, e). If in addition it holds that D^,{f) = 0(IC^(/, e)), then 

we can immediately deduce that (f) > IC^fe(/'^,e) > k ■ IC^(/, e) > fl{k ■ D^,{f)), i.e. the direct sum 
theorem holds for /. Therefore our main result also gives the following corollary: 

Corollary 1.4. For any e, /i and any f : 1 ^ Z, if De{f) = 0(log prt^(/)), then for all 6 > and integers 
k, it holds that of {f) > Q {k ■ 5'^{D^^r^^{f) - log \Z\) - k5)). 

For example, since Z)^(GHD) < 7i holds trivially, this corollary along with the fact that log p?t^(GHD) = 
Jl(n) ([Shel2, CRl 1, Vidl2], see Section 5.2) immediately implies a direct sum theorem for GHD. 

Finally, regarding the central open question of whether or not it is possible to compress communication 
down to the infomiation complexity for any function, we note that our result says that if one hopes to prove 
a negative result and separate information complexity from communication complexity, then one must use a 
lower bound technique that is stronger than the relaxed partition bound. To the best of our knowledge, the 
only such technique in the literature is the (standard) partition bound. We note, however, that to the best of 
our knowledge there are no known problems whose communication complexity can be lower-bounded by the 
partition bound but not by the relaxed partition bound. 

1.2 Related work 

Definitions of information complexity with some variations extend back to the work on privacy in interac- 
tive protocols [BYCK093], and related definitions in the privacy literature appear [Kla02, FJSIO, ACC''"12]. 
Information complexity as a tool in communication complexity was first used to prove direct sum theorems 
in the simultaneous message model [CSWYOl], and subsequently to prove direct sum theorems and to study 
amortized communication complexity as stated in the first paragraph of this paper [BYJKS04, BBCRIO, 
BRll, Bral2, BW12]. There are many other works using information complexity to prove lower bounds for 
specific functions or to prove direct sum theorems in restricted models of communication complexity, for 
example [JKS03, JRS03, JRS05, HJMR07]. 

In independent and concunent work, Chaki^abarti et al. proved that infomiation complexity is lower 
bounded by the smooth rectangle bound under product distributions [CKW12]. While our result implies the 
result of [CKW12] as a special case, we note that their proof uses entirely different techniques and may be of 
independent interest. 

2 Preliminaries 

2.1 Notation and information theory facts 

Let ;U be a probability distribution over a (finite) universe U. We will often treat /i as a function : 2^ — >■ 

[0, 1]. For T,S CU, we let i_i{T | 5) = P 

"^u-^^i, \^ T \ S\. For singletons u G U, we write interchangeably 
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Hu = n{u) = fi{{u}). Random variables are written in uppercase and fixed values in lowercase. We 
sometimes abuse notation and write a random variable in place of the distribution of that random variable. 

For two distributions fi, v, we let \ii — u\ denote their statistical distance, i.e. \^ — v\ = m.ax.TcuifJ'iT) — 
v{T)). We let D{ii \\ u) = E(/^^[log ^^^] be the relative entropy {i.e. KL-divergence). For two random 
variables X, Y, the mutual information is defined as I{X : Y) = H{X) - H{X \ Y) = H{Y) - H{Y \ X), 
where H{-) is the Shannon entropy. 

A rectangle of A' x 3^ is a product set A x S where A C X and B y. We let R denote a rectangle 
in X X y. We let {x,y) € A! x y denote a fixed input, and {X, Y) be random inputs sampled according to 
some distribution (specified from context and usually denoted by jj.). 

2.2 Information complexity 

We study 2-player communication protocols for calculating a function f -.X ^ Z, where Z A! xy. Let 
vr be a randomized protocol (allowing both public and private coins, unless otherwise specified). We denote 
the randomness used by the protocol vr by r-^. Let 7r(x, y) denote its output, i.e. the value in Z the two parties 
wish to compute. 

The transcript of a protocol includes all messages exchanged, the output of the protocol (in fact we just 
need that both players can compute the output of the protocol from the transcript), as well as any public coins 
(but no private coins). The complexity of vr is the maximum (over all random coins) of the number of bits 
exchanged. 

Let /i be a distribution over x y. Define errj(vr; x, y) = Pr^^ [f{x, y) / 7r(x, y)] if {x, y) ^ I and 
otherwise and err/(vr; ^) = E(x,y)^^err/(vr; X, Y) = Pr,^,(x,y)^^[(X, Y)elA f{X, Y) / Tr{X, Y)]. 

Definition 2.1. Fix /, fi, e. Let {X, Y, U) be the tuple distributed according to {X, Y) sampled from fi and 
then n being the transcript of the protocol vr applied to X, Y. Then define: 

1. ic^(vr) = /(X;n|y) + /(y;n|x) 

2- IC^(/, e) = inf^:errj(,r;M)<e IC^(^) 

3. ICd(/, e) = max^ IC^(/, e) 

Braverman [Bral2] also defined the non-distributional information cost IC, and all of our results extend to 
it trivially by the inequality IC/3 < IC. (We do not require the reverse inequality IC < ©(IC/j), whose proof 
is non-trivial and was given in [Bral2]). 

3 Zero-communication protocols and the relaxed partition bound 

3.1 The zero-communication model and rectangles 

Let us consider a (possibly partial) function /. We say that {x, y) is a valid input if (x, y) G X, that is, 
(x, y) satisfies the promise. In the zero-communication model with abort, the players either output a value 
z & Z (they accept the run) or output _L (they abort). 

Definition 3.1. The zero-communication model with abort is defined as follows: 
Inputs Alice and Bob receive inputs x and y respectively. 

Output Alice outputs a G Z U {_L} and Bob outputs b ^ ZVJ {-L}- If both Alice and Bob output the same 
z ^ Z, then the output is z. Otherwise, the output is _L. 

We will study (public-coin) randomized zero-communication protocols for computing functions in this 
model. 
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3.2 Relaxed partition bound 

The relaxed partition bound with error e and input distribution /i, denoted by p?t(f (/), is defined as follows. 

Definition 3.2. The distributional relaxed partition bound prt^(/) is the value of the following Unear program. 
(The value of z ranges over Z and R over all rectangles, including the empty rectangle.) 

prt^(/) = min — subject to: 

Y ^^^y Yi PRJ'.^^y) + Y ^^^y ^ ^^'^ > (i - e)^ (l) 

(x,y)Gl R:{x,y)eR ix,y)^I z,R:{x,y)<^R. 

y{x,y)exxy, J2 PR,z<V (2) 

z,R:{x,y)£R 

z = 1. (3) 



R,z 

The relaxed partition bound is defined as prt^(/) = max^ prt^(/). 

We can identify feasible solutions to the program in Definition 3.2 as a particular type of randomized 
zero-communication protocol: Alice and Bob sample {R, z) according to the distribution given by the pr^z, 
and each individually sees if their inputs ai^e in R and if so they output z, othei"wise they abort. The parameter 
rj is the efficiency of the protocol [LLR12], that is, the probability that the protocol does not abort, and ideally 
we want it to be as large as possible. 

There is also a natural way to convert any zero-communication protocol vr into a distribution over [R, z): 
sample z uniformly from Z, sample random coins for vr, and \e.t R = Ax Bhe. such that A is the set of 
inputs on which Alice outputs z in the protocol vr using random coins r^r, and similarly for B. (The sampling 
of a random z incurs a loss of |Z| in the efficiency, which is why our bounds have a loss depending on \Z\. 
See Section 3.4 for details.) 

Relation to other bounds. The relaxed partition bound is, as its name implies, a relaxation of the partition 
bound prt^(/) [JKIO]. We also show that the relaxed partition bound is stronger than the smooth rectangle 
bound srecf (/) (the proof is provided in Appendix A). 

Lemma 3.3. For all f, e and z e Z,we have srecf (/) < prt^(/) < prt^(/). 

Since Jain and Klauck have shown in [JKIO] that the smooth rectangle bound is stronger than the rectan- 
gle/corruption bound, the 72 method and the discrepancy method, this implies that the relaxed partition bound 
subsumes all these bounds as well. Therefore, our result implies that all these bounds are also lower bounds 
for information complexity. 

We briefly explain the difference between the relaxed partition bound and the partition bound (more details 
appear in Appendix A). The partition bound includes two types of constraints. The first is a correctness 
constraint: on every input, the output of the protocol should be correct with probability at least (1 — e)r]. The 
second is a completeness constraint: on every input, the efficiency of the protocol (i.e. the probability it does 
not abort) should be exactly rj. In the relaxed partition bound, we keep the same con^ectness constraint. Since 
in certain applications the function is partial (such as the Vector in Subspace Problem [KRll]), one also has 
to handle the inputs where the function is not defined. We make this explicit in our correctness constraint. On 
the other hand, we relax the completeness constraint so that the efficiency may lie anywhere between (1 — e)r] 
and 7]. This relaxation seems to be crucial for our proof of the lower bound on information complexity, since 
we are unable to achieve efficiency exactly t]. 
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3.3 Compression lemma 

Lemma 3.4 (Main compression lemma). There exists a universal constant C such that for all distributions fi, 
communication protocols vr and 6 G (0, 1), there exists a zero-communication protocol vr' and a real number 
A > 2-'^(iCm(^)/'5'+i/'5) such that 

|(X,y,7r(X,y)) - {X,Yy{X,Y)\7:'{X,Y) ^ < 6 (4) 

(in statistical distance) and 

y{x,y) Pr[7r'(x,y) < (1 + 5)A (5) 

''vr' 

Pr [7r'iX,Y)^±]>il-6)X. (6) 

Our compression vr' extends the strategy outlined by [BW12]. At a high level, the protocol vr' does the 
following: 

Sample transcripts Alice and Bob use their shai^ed randomness to repeat T independent executions of an 

experiment to sample transcripts (Protocol 4.1). Alice and Bob each decide whether the experiment is 

accepted (they may differ in their opinions). 
Find common transcript Let A be the set of accepted experiments for Alice, and B the set of accepted 

experiments for Bob. They try to guess an element of ^ n ;B. If they find one, they output according to 

the transcript from this experiment. 

We prove our compression lemma in Section 4. 

3.4 Information cost is lower bounded by the relaxed partition bound 

We show how our compression lemma implies the main theorem. 

Proof of Theorem 1.1. Let vr be a randomized communication protocol achieving IC^(/, e) and let TZ be the 
following relation that naturally arises from the function / 

= {{x, y, f{x, y)) : (x, y) G X} U {{x, y, z) : (x, y) ^I,z€ Z}. 

Let us now consider the zero-communication protocol vr' from Lemma 3.4. As mentioned in Section 3.2, 
there is a natural way to identify vr' with a distribution over labeled rectangles {R, z): sample z uniformly 
from Z, sample and let R = A x B where A is the set of inputs on which Alice outputs z, and similarly 
for B. The sampling of z incurs a loss of\Z\ in the efficiency. 

We make this formal: for any fixed randomness r occurring with probability pr, we define the rectangle 
R{z, r) as the set of (x, y) such that the protocol outputs z, and we let pn^z = '}2t-r=r{z r) Pr/\^\- 

We check the normalization constraint 

R,z ' ' R,z r:R=R{z,r) ' ' r R^z:R.=R{z,r) r 

To see that Equation 2 is satisfied, we have by definition of pr^z that for any {x,y): 

E P^, = ^Pr[vr'(x,y)/±]<il±^. 

\Z\ r^i \Z\ 

z,R:{x,y)eR 
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Finally, to see that Equation 1 is satisfied, we have 

= ^ Pr [{X,Yy{X,Y))en] 

= ^ Pr [7r'{X,Y)^±] Pr [(X, y, 7r'(X, y)) e 7^ | 7r'(X, F) ^ ±] 
> ^(1-5)a( Pr [{X,Y,7T{X,Y))Gn]-6 

- ^(l-^)-^(l-e-'^) > ^A(l-e-2<5) > ^(l-e-3<5) 

where for the last line we used the fact that vr has error e, and so Pr,.^ (x,y)~/i[(^, Y, tt{X, Y)) G 7^] > 1 — e. 
This satisfies the constraints in the linear program (Definition 3.2) for prt^^3^(/) with objective value rj = 

(1 + 5)X/\Z\ > 2-c^(iCmW/5^+i/<5) /\z\. □ 

By the definitions of the information complexity and the relaxed partition bound, we have immediately 

Corollary 3.5. There exists a universal constant C such that for all functions f : I ^ Z, all e,6 £ (0, 1/2), 
we have ICz)(/, e) > §[log prt.^ssif) - log \Z\] - 6. 

4 The zero-communication protocol 

The zero-communication protocol consists of two stages. First, Alice and Bob use their shared random- 
ness to come up with candidate transcripts, based on the a priori information they have on the distribution of 
the transcripts given by the infonnation cost of the protocol. To do this, they run some sampling experiments 
and decide which ones to accept. Second, they use their shared randomness in order to choose an experiment 
that they have both accepted. If anything fails in the course of the protocol, they abort by outputting _L. 

4.1 Single sampling experiment 

The single sampling experiment is described in Protocol 4.1 and appeared first in [BW12] (variants also 
appeared in [Bral2] and [BRll]). Roughly, Protocol 4.1 takes a distribution r and two distributions zv^, ub 
over a universe U such that i^a, i^b are not too far from r and tries to sample an element of U that is close to 
being distributed according to r. 

Let us informally describe the goal of this sampling experiment in our context. Alice knowing x and Bob 
y want to sample transcripts according to Ilx,y which is the distribution over the transcripts of the protocol vr 
applied to (x, y). When inputs x, y are fixed, the probability of a transcript u occurring is the product of the 
probabilities of each bit in the transcript. The product of the probabilities for AUce's bits is some function 
Pa{u) which depends on x and the product of the probabilities for Bob's bits is some function Pb{u) which 
depends on y and 11^ y(it) = Pa{u)pb{u)- Alice can also estimate Pb{u) by taking the average over y of 
Ily{u). Call this estimate qA{u); similarly for Bob's estimate qB{u)- Set = PaQa and ub = QbPb- 

The challenge is that Alice and Bob know only {pA, Qa) and {pB, Qb) respectively and do not know r (in 
our setting, r = ILx,y)- They use a variant of rejection sampling, in which Alice will overestimate qa by a 
factor 2^^; likewise for Bob. Let us define the set of A-bad elements with respect to r, u as follows: 

Ba{t, L') = {ueU\ 2^i/(u) < r(u)}. 

Intuitively, u is bad if r gives much more weight to it than u. Observe that if r = paPb, va = PaQa, then 
u ^ Ba{t, va) implies that 2^qA{u) > Pb{u). 

To prove our compression lemma, we use the following claim about the single samphng experiment. 
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Fix a finite universe U. Let p/\,qA,PB,qB : — > [0, 1] such that r = paPb, >^a ~ PaQa, vb ~ PbQb are all 
probability distributions. 

Alice's input: pA, Qa- Bob's input: pbtQb- Common input: parameter A > 0. 

1. Using public coins, sample u -(^ U, a, f3 -i^ [0, 2^]. 

2. Alice accepts the run if a < pa{u) and (3 < 2^qA{u). 

3. Bob accepts the run if a < 2^qB{u) and (3 < pb{u). 

4. If both Alice and Bob accept, then we say that the experiment is accepted and the output is u. Otherwise, the 
output is ±. 

Protocol 4.1. Single sampling experiment 



Claim 4.2. Let B = B/\(t, up,) U B/\{t, z^b)- Let 7 = t{B). Then the following holds about Protocol 4.1: 

1. The probability that Alice accepts equals and the same for Bob. 

2. The probability that the experiment is accepted is at most |^|^2A at least |^|~^^ . 

3. Let t' denote the distribution of the output of the experiment, conditioned on it being accepted. Then 
\t — t'\ < 7. 

Intuitively, this claim says that Alice accepts each single experiment with probability i^i"^^ , and also 
implies that conditioned on Alice accepting the i'th experiment, it is relatively likely that Bob accepts it. 
Therefore, by repeating this experiment enough times, there is reasonable probability of Alice and Bob both 
accepting the same execution of the experiment. Conditioned on the experiment accepting, the output of the 
experiment is distributed close to the original distribution r. In the next section, we show how to use a hash 
function to select a common accepting execution of the experiment out of many executions. 

We will use the following lemma that appears in [Bral2]. 

Lemma 4.3 ([Bral2]). For all r, u, A, e, it holds that t{B^{t, u)) < -^("^1^)+-^ . 

Proof of Claim 4.2. We use the arguments first given in [BW12] and prove the items in order. 

1. Probability Alice/Bob accepts. We do the analysis for Alice; the case for Bob is entirely symmetric. 
We may write: 



Pr[Alice accepts] = ^ ^^gA(^) = L ^A(n) = p^. 



2. Probability of accepting. First consider u ^ B. For such n, if a < pa{u) then a < 2^qB{u) and also 
if /? < pb{u) then /3 < 2^qj\{u). Therefore we may write 

T3 rc ■ , , , ^ 1 Ph{u)pQ{u) t{u) 

Pr [Experiment outputs u] = — ^ = jj^- (7) 

Furthermore, for any n G ZY, we may write 

1 Tiu] 

Pr[Experiment outputs u] = • min {p^{u), qB{u)2^} ■ min {pq{u) , qf^{u)2^] < 



|^|22A L^av^;,^«v^;- j ...... J - |^|22A ' 
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For the upper bound we have: 



Pv[3u G U, Experiment outputs u] = ^ Pr [Experiment outputs u] < ^ „^ = . 

ueu u&u ' ' ' ' 

For the lower bound we have 

Pr [Experiment outputs u] > Pr [Experiment outputs u] 

t{u) 1 - t{B) 1-7 



3. Statistical closeness of r and r'. Let ?] denote the probability that the experiment is accepted. From 

[ 1-7, 1 



the previous point, we have that rj G [i^tSk^ thwik]- By the definition of statistical distance, it suffices 



to prove that: 

V5 C U, t{S) - t'{S) < 7. 

We proceed by splitting the elements of S based on whether they intersect B. 

r(S) - t'{S) = t{S DB)- t'{S nB) + t{S DB)- r'(5 n B) 

<-f + T{Sr\B)-T'{SnB). 

From Equation 7 we can deduce that t'{S D B) = Therefore: 



1 



r(5 nB)- t'{S nB)= r(5 n B){1 - jj^,. 
Since rj < p^^, we have that (1 — j^^^j^as^) ^ 0» which concludes the proof. 



□ 



4.2 Description and analysis of the zero-communication protocol 

Let fi be any distribution on inputs and vr be any protocol with information complexity / = IC^(7r). Let 
{X, Y, n) be the joint random variables where X, Y are distributed according to and 11 is the distribution 
of the transcript of the protocol vr applied to X, Y (by slight abuse of notation we use the letter IT for both the 
transcript and its distribution). Let n^^^j, be 11 conditioned on X = x,Y = y,llxhell conditioned X = x, 
and Ily likewise. 

Let U be the space of all possible transcripts. We assume that each transcript contains the output of 
the protocol. As shown in [Bral2] and described above, Alice can construct functions p^i Qa '■ ^ ^ [0, 1] 
and Bob can construct functions pB,qB '■ U — )• [0,1], such that for all u ^ U, Yix,y{u) = p/\{u)pb{u), 
Tlx{u) = p^{u)q^{u), and Tly{u) = PB{u)qQ{u). 

The zero-communication protocol vr' is described in Protocol 4.4. This protocol is an extension of the 
one in [BW12], where here Alice uses pubhc coins to guess the hash function value instead of calculating 
and transmitting it to Bob and both players are allowed to abort when they do not believe they can output the 
coiTcct value. 

In order to analyze our protocol, we first define some events and give bounds on their probabilities. 

Definition 4.5. We define the following events over the probability space of sampling (X, Y) according to 
and running vr' on (X, Y) to produce a transcript 11: 
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Alice's input: x. Bob's input: y. Common inputs: S > 0, 1 > 0. 

Set parameters: A = | • + 1) and T = \U\2'^ lii(8/(5) and fc = A + log(f hi{8/S)^). 

1. Alice constructs functions p^, q/\ : U ^ [0, 1] and Bob constructs functions ps, qs, : U [0, 1], such that 
for all transcripts u <eU, \lx,y{u) = pa(m)pb(m), \ix{u) = pi\{u)qi\{u), and n.y(ii) = pB{u)qB{u). 

2. (Run experiments.) Using public coins, Alice and Bob run Protocol 4.1 T independent times with inputs 
PA, gA,PB, qs and A. 

3. Let A = {i £ [T] : Alice accepts experiment i} and similarly B for Bob. If either set is empty, that party 
outputs the abort symbol _L. 

4. (Find intersection.) Using public coins, Alice and Bob choose a random function h : \T] ^ {0, l}'^ and a 
random string r G {0,1}*^. 

(a) Alice finds the smallest i ^ A. If h{i) ^ r then Alice outputs _L. Otherwise, Alice outputs in 
accordance with the transcript of experiment i. 

(b) Bob finds the smallest j G B such that h{j) = r. If no such j exists, he outputs ±. Otherwise, Bob 
outputs in accordance with the transcript of experiment j. 

Protocol 4.4. Zero-communication protocol vr' derived from vr 



1. Large divergence. Bd occurs if {X,Y) = {x,y) such that D{Jlx,y \\ ^x) > ^""g^""'' or Dijlx^y \\ 
Yiy) > ^jjj ^jgQ jgj- denote the set of such {x,y). 

2. Collision. Be occurs if there exist distinct i, j AU B such that h{i) = h{j) = r. 

3. Protocol outputs something. H occurs if tt'{X, Y) ^ _L. 

The proof of the main compression lemma (Lemma 3.4) uses the following claim. 
Claim 4.6. The probability of the above events are bounded as follows: 

1. The inputs rarely have large divergence: Pi'(x,y)~^[-^-D] — ^1^- 

2. For all (x, y), the hash function rarely has a collision: Pr^^, [Be] < ^ • 2"^'^+'^). 

3. For all {x,y) ^ Bd, the probability of outputting something is not too small: Pr^ ,[H] > (1 — 

115\9-(fc+A) 
16 '"^ 

4. For all (x, y) the probability of outputting something is not too large: Pr-r^, [H] < (1 + ^)2^('^"'~^). 

5. For all protocols vr, input distributions fi and S > 0, the protocol vr' in Protocol 4.4 satisfies: For all 
{x,y) ^ Bd, let ^'xyff be the distribution o/7r'(x,y) conditioned on H (namely, on 7r'(x,y) 7^ _Lj. 
Then \U,^y - n^^^^^'l < 35/4. 

Proof. In the following, we will frequently use the fact that for all p, a G [0, 1] , it holds that p{l — a) > p — a. 
We extend the arguments given in [BW12] to prove the items of the claim in order. 

1. By the definition of information complexity and the fact that mutual information is equal to the expec- 
tation of the divergence, we have that for {X, Y) distributed according to fi, 

IC^(7r) = I{X;U I Y) + I{Y;U \ X) = E(,, ,y)^(x,y) [Z?(n,,,y || Hy) + D{Ii.,^y \\ H,)]. 

This implies that E(2,. ,;)<_(x,y) [-^^(nx,?/ || n^^)] < I C^(7r), and since divergence is non-negative we have 
by Markov's inequality that 

Pr [Z?(n,,, II n,.) > 8IC^(7r)/<^] < 5/8. 

{x,y)^{X,Y) 

The same argument holds for D(Jlx^y \\ liy) and by a union bound, we have Pr(x,y)~^[-BD] < 5/4. 
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2. We may write: 

Fv[Bc] 

= Fr[3i / j G [T] s.t. ie{AuB),j e {AuB),h{i) = h{j) 
< Pr[iG{ALlB)Aje{AuB)Ah{i) = h{j)=r] 

= Pr[i G U B)] Pr[j e{AU B)] Pr[/i(i) = h{j) = r] 



< T 



2 



4 



(|ZY|2^)2 22^ 



^ 5 9-(fc+A) 
- 16 ■ ^ 

where we have used the independence between the trials and independence of the h from the trials, as 
well as Item 1 of Claim 4.2. 

3. Let us define G to be the event that the smallest i ^ A satisfies h{i) = r, and also i ^ B. (Notice this 
implies that are both non-empty.) We have 

Pr[G] = Pr[^ / 0] • Pr[G \A^0]. 

Observe that an element z is in ^ if and only if experiment i is accepted by Alice. By Item 1 of 
Claim 4.2, the probability of Alice aborting each experiment i is 1 — j^^j^- Since the experiments are 
independent, the probability of Alice aborting all experiments is 

Pr[^ = 0] = (l - -LS < < 5/8. 



We assume now that A is non empty and we denote by i its first element. 

Pr[G \A^0] 

= Pr[/i(i) = r\ Ay^ 0] Pr[i e B\i e AA h{i) = r AA^^ 0]. 

For all j, the probability that h{j) = r is exactly 2~^, in particular this holds for the first element of A. 

For any (x, y) ^ Bd, we have that Dijlx^y \\ Hx) < 8IC^(7r)/(5. Let us say that a transcript is "bad for 
Alice" (resp. Bob) if it lies in the set B^illx^y, lix) (resp. in the set B^^ijlx^y, ^y)- Using Lemma 4.3, 
this implies that 

, flC.M + l 
-fA = Pr[n^,j/ bad for AUce] < ^ < 5/4 

7B = Pr[nx,y bad for Bob] < 5 /A. 

It follows that 7 = Vi\n.x,y bad for Alice or Bob] < 7a + 7b < 5/2. 

By definition, for any j, experiment j is accepted if and only ii j ^ Ar\B. Therefore, Vj G [T]: 

Pr[j ^ B \ j ^ A\ = Pr [experiment j is accepted | j G ^] 
Pr [experiment j is accepted] 



Pr[j G A] 

1-7 
2^ 



> 



1- ^ 

> 2 
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where we used Item 1 and Item 2 of Claim 4.2, and the fact that -^Bo implies that 7 < 5/4. 

Also, observe that by the definition of the protocol, the choice of h and r are completely independent 
of the experiments. Therefore we may add the condition that h{j) = r without altering the probability. 
Since j ^ A implies ^ 7^ 0, we can add this condition too. We use this with j = i, so therefore we 
may write 

1 _ i 1 _ M 
Pr[G]>(l-5/8)^>^. 

Finally, observe that H \ Be = G \ Bq, therefore we may conclude that: 

Pr[i?] > Y>y[H \ Be] = Pr[G \ Be] > Pr[G] - Y>i[Bc] > (1 - ^)2-^'-^. 

4. We will again use the event G as defined in the previous point. We will again use the fact that: 

Pr[Gl = PrU / 0] Pr[/i(i) = r M / 0] Pr[i G B U G ^ A hii) = r NA-^0] 

< Pr[/i(f) = r\A^0] Pr[i eB\ieAA h{i) = r NAi^Oi]. 

As before the first factor is exactly 2^^ for any i. We may also write: 

Pr[i G 5 I i G ^] = Pr [experiment i is accepted | f G ^] 
Pr [experiment i is accepted] 
~ Pr[f G ^] 

1 

- 2^' 

where we used Item 1 and Item 2 of Claim 4.2. As with the previous point, adding the conditions 
h(i) = r and A does not affect the probabilities. Therefore, Pr[G] < 2~^^^. Finally, observe 
that i7 C G U ^c, and therefore: 

Vt[H] < Pr[G U Be] < Pr[G] + VT[Be] < (1 + ^)2~'^-^. 

5. The distribution of 11^ ^ conditioned on not aborting and on no collision is simply the distribution of 
the output of a single experiment, and we know from the facts about the single experiment (Claim 4.2) 
that this is close to Tlx.y. We wish to conclude that 11^ y conditioned only on not aborting is also close 
to n^. y. The following lemma allows us to do this by using the fact that the probability of collision is 
small: 

Claim 4.7. Let IT and 11' two distributions taking output in a common universe. Let F and H be 
two events in the underlying probability space of XT'. Finally, we let 11^ denote the distribution IT' 
conditioned on E = H\F, and assume that [11^ — n| < c. Then it holds that [11^ — n| < c + p^"'|^| . 

Proof. For shorthand, for any event E let us write 11'^ (E') = Prn'[^ I H], and similarly for n'(£^) 
and n(£'). For a set 5 in the support of H' and 11, we let S also denote the event that the value of the 
random variable is in S. 
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It suffices to prove that, for all subsets S in the union of the supports of H and 11'^, it holds that 
U'^iS) - U{S) < c + p'g[j^] . To show this, we may write: 

u'^iS) - uis) = - n(5) 

^ u'{EnS) + u'{{H\E)nS) 

- inH) ^^^'^^ 

^ U'iE)ic + U{S))+U'{F) 

- iFiH) 

since {H \ E) F and jn'^. — n| < c. Using the fact that E d H ,\Ne. can conclude that: 

n^(5)-n(5)<c+^ = c+p^. 

□ 

We apply this lemma with 11 = Tix,y, H' = 11^ y, = Bq, and the event that 7r'(x, y) / ±. We 
note that = \ = G \ 5c. We calculate c, Pr^^, [F] and Pr,.^, [H]: 

• Since /i and r are completely independent of the actual experiments themselves, it holds that 
the distribution of 11^ conditioned on G \ Be is identical to the output of a single experiment 
(Protocol 4.1). Since {x,y) ^ Bu, the measure of the bad set B^^iJlx^y^Iiy) U B^(Jlx,y,^x) is 
bounded by 5/2. We apply Item 3 of Claim 4.2 to deduce that \Tlx,y ~'^xy e\ < <5/2 = c. 

• From Item 2 of Claim 4.6, we know that Pr^^, [F] = Pr,.^, [Bc\ < ^ • 2~^''+^\ 

• From Item 3 of Claim 4.6, we know Pr^^, [H] > (1 - i^)2-('=+^) because {x, y) ^ Bd- 

Therefore, Claim 4.7 implies that: 

in.,, - K,y,^A < V2 + ^ n^^ ^ V2 + 6/5 < 3d/A 

-Lou 16 ) 



where we use the assumption that 6 < 1. 



□ 



4.3 Proof of the Compression Lemma 

Proof of Lemma 3.4. Set A = 2-^''+^). It holds that A > 2-C'(ic^(7r)/52+i/5) f^j. q ^ 54 ^ ^j^y 
subset of the support of {X, Y, 7r(X, Y)). Then 

Pr [{X,Y,n{X,Y))€n]< Pr [Bol+^Pr [^Bd]- Pr [{X,Y,7r{X,Y)) € TZ \ ^B^]. 

Applying Item 5 of Claim 4.6 and the fact that TZ is simply an event, it follows that for all {x,y) ^ Bd 
Pr[(x, y, 7r{x, y)) G 7^] < Pr[(x, y, 7r'{x, y)) e TZ \ 7r'{x, y) ^ ±] + f . 

Since Vt[Bd] < 5/ A (Item 1 of Claim 4.6), 

Pr [(X,y,7^(X,y))G7^]< Pr [(X, F, 7r'(X, Y)) G 7^ | ^'(x, / ±] + 5. 

r^,(X,y)~At r^,,(X,y)--/i 



13 



This proves one direction of Equation 4. For the other direction, we have that 



Pr[(x,y,7r'(x,y)) G n \ ^'(x,y) / ±] 

< Vt[Bd] + Prh^D] • Pr[(X, Y, 7r'(X, Y))) G 7^ | -S^, 7r'(X, Y) / ±] 

< I + V^[-^Bd] ■ (Pr[(X, y, 7r(X, y )) G 7^ I -Sd] + f ) 

< Pr[(x, y, ^(x, y)) G 7^ A ^Bd] + s 

< Pr[(x,y,^(x,y)) en] + 6 



(8) 



(9) 



where in Equation 9 we applied Item 5 of Claim 4.6. This proves Equation 4 of Lemma 3.4. 
Equation 5 follows immediately from Item 4 of Claim 4.6. 
Finally, for Equation 6, we may write: 




where we used Item 3 of Claim 4.6. 



□ 



5 Applications 

We can prove lower bounds on the information complexity of specific problems, by checking that their 
communication lower bounds were obtained by one of the methods subsumed by the relaxed partition bound, 
including the factorization norm, smooth rectangle, rectangle, or discrepancy. However, a bit of care is re- 
quired to ascertain this. For example, while a paper may say it uses the "rectangle bound", we must still verify 
that the value of the linear program for p?t (or one of the subsumed programs such as srec or rec) is at least 
the claimed bound, since different authors may use the term "rectangle bound" to mean different things. In 
particular what they call "rectangle bound" may not satisfy the constraints of the rectangle/smooth rectangle 
linear programs given by Jain and Klauck [JKIO]. After we have verified that prt is appropriately bounded, 
then we can apply our main theorem (Theorem 1.1). We do this for the problems below. 

5.1 Exponential separation of quantum communication and classical information complexity 

We prove that the quantum communication complexity of the Vector in Subspace Problem is exponentially 
smaller than its classical information complexity (Theorem 1.2). In the Vector in Subspace Problem VSPo,n> 
Alice is given an n/2 dimensional subspace of an n dimensional space over M, and Bob is given a vector. 
This is a partial function, and the promise is that either Bob's vector lies in the subspace, in which case the 
function evaluates to 1, or it lies in the orthogonal subspace, in which case the function evaluates to 0. Note 
that the input set of VSPo,n is continuous, but it can be discretized by rounding, which leads to the problem 
YSPe^n (see [KRll] for details). 

Klartag and Regev [KRll] show that the Vector in Subspace Problem can be solved with an O(logn) 
quantum protocol, but the randomized communication complexity of this problem is 0(n^/^). Their lower 
bound uses a modified version of the rectangle bound, which can be shown to be still weaker than our relaxed 
partition bound. 

Lemma 5.1. There exist universal constants C and 7 such that for any e, 



p"rt,(VSP^,,„) > -(0.8 - 2.8e) exp(7n 




Klartag and Regev's lower bound is based on the following lemma: 
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Lemma 5.2 ([KRl 1]). Let f = VSPo,n. ^ (^nd y denote Alice and Bob's input sets for f, a be the uniform 
distribution over A! x y and ab be the uniform distribution over f~^{h). There exist universal constants C 
and 7 such that, for any rectangle R and any b G {0, 1}, we have: 



abiR n f'\b)) > 0.8a{R) - Cexp(- 



■772 



1/3N 



We show that this implies the required lower bound on the relaxed partition bound of VSP^.n- 

Proof of Lemma 5.L Let us first consider / = VSPo,n and show that log prt^(VSPo,n) = Note that 

since the input set of VSPo,n is continuous, we need to extend the definition of the relaxed partition bound to 
the case of a continuous input set, but this follows naturally by replacing summation symbols by integrals. 

Let ^ = ^a±^, which satisfies = 1, i.e. fi only has support on valid inputs. Then, Lemma 5.2 

implies that for any rectangle R and any b € {0, 1}, we have 



2n{R n f^^{b © 1)) > 0.8a{R) - Cexp( 



-771, 



l/3^ 



Since only has support on / ^ and / is Boolean, we have fJ.{R) = ^{R n / ^(6)) + ijl{R H f ^(6 ® 1)), 
so that the inequality can be rewritten as 

2n{R) - 2fi{R n f-^{b)) > 0.8a{R) - C exp{--fn^/^). 

Setting t = 2hEt^lA^ we have that: 

t ■ {2^i{R n f~^{b)) + 0.8a{R) - 2^{R)) < 1. 

We now construct a feasible point for the dual formulation of prt^(/) given in Claim A.2 by setting 

a{x,y) = 2tii{x,y) for {x,y) G f^^ and a{x,y) = 0.8ta{x,y) otherwise; and P{x,y) = 2t^{x,y) — 
0.8ta{x,y) for {x,y) G /~^, and P{x,y) = otherwise. Note that for {x,y) G /^^, we have fi{x,y) = 
a{x, y)/o'{f~^) > a{x, y), hence all these values are positive. By the previous inequality, we also have 



a{x,y)dxdy + / a{x,y)dxdy — / (3{x,y)dxdy < 1 

Rnf-'^{b) JR\f-^ Jr 

for any R, b, therefore this is a valid feasible point. Moreover, the corresponding objective value is 



{1 — e)a{x,y)dxdy — j P{x,y)dxdy 

= t ((1 - e)2^(/-i) + (1 - e)0.8(l - a{f-^)) - 2^Ji{f-^) + OMr')) 
= t{0.8 - 2.8e + 0.8e(7(/~^)) 
> t{0.8 - 2.8e). 

Finally, let us note that we can construct a zero-communication protocol for VSPo,n by first rounding off the 
inputs, and then applying a zero-communication protocol for VSPg^^. This means that we can turn a feasible 
point for the primal form of prt^(VSPg „) (given in Claim A.2) into a feasible point for p?t^(VSPo,n) with 
the same objective value, so that p?t^(VSPe,n) > prte(VSPo,ri) > t{0.8 - 2.8e). □ 

Finally, Lemma 5. 1 together with Theorem 1.1 implies that IC(VSPe^„, e) = and also Theorem 1.2.| 

This allows us to conclude that the information complexity of this function is at least This solves 

Bravemian's Open Problem 3 (Are there problems for which Q{f,e) = 0(polylog(IC(/, e)))?) and Open 
Problem 7 (is it true that IC(VSP9,„, 1/3) = n^^^)?) 

Moreover, our result implies an exponential separation between classical and quantum information com- 
plexity. We refrain from defining quantum infomiation cost and complexity in this paper (see [JNIO] for a 
definition), but since the quantum information complexity is always smaller than the quantum communication 
complexity, the separation follows trivially from Theorem 1.2. 
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5.2 Information complexity of the Gap Hamming Distance Problem 

We prove that the information complexity of Gap Hamming Distance is Q{n) (Theorem 1.3; Open Prob- 
lem 6 in [Bral2]). In the Gap Hamming Distance Problem (GHD,„), Alice and Bob each receive a string 
of length n and they need to determine whether their Hamming distance is at least n/2 + ^/n or less than 
n/2 — ^Jn. We prove that the information complexity of Gap Hamming Distance is £l{n) (Theorem 1.3; 
Open Problem 6 in [Bral2]). The communication complexity of Gap Hamming Distance was shown to be 
by Chakrabarti and Regev [CRll]. The proof was subsequently simplified by Vidick [Vidl2] and 
Sherstov [Shel2]. The first two proofs use the smooth rectangle bound, while Sherstov uses the rectan- 
gle/corruption bound. 

The corruption bound used by Sherstov is a slight refinement of the rectangle bound as defined by Jain 
and Klauck [JKIO], since it can handle distributions that put small weight on the set of inputs that map to 
some function value z. It can be shown that this bound is weaker than our relaxed partition bound, which 
implies Theorem 1.3. 

Lemma 5.3. There exist universal constants C and 5 such that for any small enough e, prt^(GHD) > (72*^". 

We let the output set he Z = { — 1, 1} rather than bits, to follow the notation of [Shel2]. Let us recall the 
corruption bound used by Sherstov. 

Theorem 5.4. For any function f with output set Z = { — 1, 1}, and €,6, f3 > if a distribution on the inputs 
fj, is such that 

then 

2^^(^) > ^(KrH-i)) - 

We can derive the general corruption bound as used by Sherstov by giving a feasible solution to the dual 
of the linear program by Jain and Klauck. In the dual form, recf(/) is defined for z £ Z as 

recf(/) = max(l-e) ^ - e ^ ax,y (10) 

""■""^ ix,y)ef-Hz) {x,y)ef-^\f-H^) 

\/R, Yl "^-^y- Yl C,x,y<l. (11) 

{x,y)eRnf-Hz) (x,y)GiJn(/-i\/-i(2)) 

For the remainder of this section fix z = — 1. Consider the following assignment for reef, where fi is the 
distribution over the inputs in the theorem. Letting ax,y = ^^(x, y) if f{x, y) = —1, and ax^y = j^ii{x, y) 
if /(x, y) = 1, we can verify the constraints and the objective value is greater than the corruption bound. 

To conclude the bound on GHD, Sherstov gives a reduction to the Gap Orthogonality Problem (ORT) 
and proves the following lemma. 

Lemma 5.5 ( [Shel2]). Let f denote the Gap Orthogonality Problem. For a small enough constant 5 < 1 
and for the uniform distribution jj, on inputs, and any rectangle R such that IJ,{R) > 2~^^, ii{R n /~^(1)) > 
6n{R). Furthermore, n{f-^{-l)) = 9(1). 

Putting all this together, we have that reef (ORT) > C2^"' for appropriate choices of e. 

Finally, there is a simple reduction from one instance of ORT to two instances of GHD (see [Shel2] for 
details). This means that [p?t^/2(GHD)]^ > p?t^(ORT), since given any zero-communication protocol for 
GHD with eiTor e/2 and efficiency rj, one can give a zero-communication protocol for ORT in the obvious 
way, by simply running the protocol for GHD twice and the reduction to determine the output for ORT. By 
a union bound this incurs error e, and it has efficiency t]^ (the probability that the two independent calls to 
the GHD protocol both do not abort). Therefore we may conclude that [p?t^/2(GHD)]2 > p?t^(ORT) > 
reef (ORT) > C2^" . 
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6 Conclusions and open problems 

We have shown that the infomiation complexity is lower bounded by a relaxed version of the partition 
bound. This subsumes all known algebraic and rectangle based methods, except the partition bound. It 
remains to be seen if the partition bound also provides a lower bound on the information complexity. Alterna- 
tively, if we would like to separate the communication and information complexities, then possible candidates 
could be functions whose partition bound is strictly larger than their relaxed partition bound. 

Moreover, we have seen how the relaxed partition bound naturally relates to zero-communication pro- 
tocols with abort. Actually, we can relate all other lower bound methods to different variants of zero- 
communication protocols [DKLRll, LLR12]. This provides new insight on the inherent differences be- 
tween these bounds and may lead to new lower bound methods, coming from different versions of zero- 
communication protocols. Moreover, since these protocols have been extensively studied in the field of quan- 
tum information, it is intriguing to see what other powerful tools can be transferred to the model of classical 
communication complexity. 
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A Relaxed partition bound and other bounds 

Recall the definition of the partition bound prt^(/) in [JKIO]: 
Definition A.l ([JKIO]). The partition bound prt^(/) is defined as the value of the following linear program: 

prt,(/) = min Vwij,^ subject to: V(x,y)GX, V WRj^^^^y) >l - e 

' ~ R,z R:(x,y)&R 

y{x,y)£Xxy, WR,, = l. 

z,R:{x,y)eR 

(To put this into a form similar- to Definition 3.2, simply substitute ^ = J2r z '^R,z and pR^z = wr^zV-) 
We first show that prt^(/) is indeed a relaxation of the partition bound by providing linear programming 
formulations for prt^(/) and prt^(/). 

Claim A.l. p?t^(/) can be expressed as the following linear programs: 
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1. Primal form: 



Prt^(/) = mill y^WR^^ subject to: 



R,z 



2. Dual form: 

P"<(/) 



l^x,y WRJ(x,y) + Yl ^^'•'■y ^^^^ > 1 - e 

{x,y)el R:{x,y)eB. {x,y)^X z,R:{x,y)€R 

z,R:{x,y)eR 



max (1 — e)a — > 8^. v subject to: 

{x,y)£Xxy 

(x,y)eiJn/-i{z) ix,y)eR\x {x,y)eR 



Similarly, prt^{f) is given by the value of the following linear programs: 
1. Primal form: 



prt,(/) = minV 

R.z 



subject to: V(x, y) G X, Y '^RJ{x,y) > 1 " « 

R:{x,y)£R 

V(x, y) iX, Y ^R'^ > 1 - e 

^,ii:(x,j/)G_R 

z,R:{x,y)eR 



2. Dual form: 



Prt.(/) 



max 



E (1 

(x,y)GXxy 

yR,z, Y 



{x,y)exxy 



a 



x,y 



(x,y)e-Rn/-i{^) 



{x,y)&R\I 



{x,y)eR 



Proof The primal form of prt^(/) can be obtained from Definition 3.2 by using the change of variables 
wr^z = PR,z/f]- The dual form then follows from standard linear programming duality. 

As for prt^(/), by Definition 3.2 we have prt^(/) = max^ prt^(/). The dual form of prt^(/) then follows 
from the dual form of p?t^ via the change of variables a-x^y = afix,y Finally, the primal form can be obtained 
via standard linear programming duality. □ 

Let us compare the primal form of p?t^(/) with the definition of the partition bound prtg(/). We see that 
the first two constraints in pTt^(/) imply that J2z r-{x y)eR. '^R,z li^^ between 1 — e and 1, while for prt^(/), 
this should be exactly equal to 1. In terms of zero-communication protocols, this difference can be interpreted 
as follows: for prt^(/), the protocol should output anything but _L with constant probability r] for any input 
(x, y), while for prt^(/), the probability of not outputting ± is allowed to fluctuate between (1 — e)7] and r]. 

Just as the partition bound, the relaxed partition bound is stronger than the smooth rectangle bound, 
defined in [JKIO] as foUows: 
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Definition A.3 ([JKIO]). The smooth rectangle bound srecf" (/) is the value of the following lineai^ program: 
srecf (/) = mill V w;^ subject to: V(x,y) G /"^^o), Y] > 1 - e 

^- R R:(x,y)&R 

R:{x,y)&R 

y{x,y)eI\r\zo), Yl ^fl^^- 

R:{x,y)eR 

Let us now prove Lemma 3.3, that is, srecf''(/) < pTt^(/) < prt^(/). 

Proof of Lemma 3.3. The second inequality is immediate since the only difference between the linear pro- 
grams is that the constraint Y^_, p,.^^ ^-j^^ wr^z = 1 in the primal form of prt^(/) has been relaxed to 1 — e < 

Y,z,R:{x,y)&R'^R,z < 1- 

As for the first inequality, let wr^z be an optimal solution for the primal formulation of prt^(/) in 
Claim A.2. Then, it is straightforward to check that setting w'^ = wr^zq leads to a feasible point for srecf (/), 
with objective value w'j^ = Eij '^R,zo < T,r,z ^R,z = Prte(/)- ^ 
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